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Abstract 

Delay constrained linear transmission (LT) strategies are considered for the transmission of com¬ 
posite Gaussian measurements over an additive white Gaussian noise fading channel under an average 
power constraint. If the channel state information (CSI) is known by both the encoder and decoder, the 
optimal LT scheme in terms of the average mean-square error distortion is characterized under a strict 
delay constraint, and a graphical interpretation of the optimal power allocation strategy is presented. 
Then, for general delay constraints, two LT strategies are proposed based on the solution to a particular 
multiple measurements-parallel channels scenario. It is shown that the distortion decreases as the delay 
constraint is relaxed, and when the delay constraint is completely removed, both strategies achieve the 
optimal performance under certain matching conditions. If the CSI is known only by the decoder, the 
optimal LT strategy is derived under a strict delay constraint. The extension for general delay constraints 
is shown to be hard. As a hrst step towards understanding the structure of the optimal scheme in this 
case, it is shown that for the multiple measurements-parallel channels scenario, any LT scheme that 
uses only a one-to-one linear mapping between measurements and channels is suboptimal in general. 

Index Terms 

Linear transmission, delay constraint, composite of Gaussians, fading channel, water hlling, wireless 
sensor networks. 
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I. Introduction 

Near real-time monitoring of a physieal phenomena is of great significanee to many emerging 
network applieations, such as monitoring of voltage, current magnitudes, active/reactive power 
values in smart grids (SGs) lUl, or temperature and humidity in forest fire detection networks [|3. 
To this end, wireless sensors are deployed throughout the physical network and the sensor 
measurements are delivered to a control center (CC) over wireless links. For the robust, reliable 
and efficient management of the underlying physical networks, near real-time and accurate 
reconstruction of the measurements at the CC becomes necessary. For example, in conventional 
state estimation for the electricity grid, measurements are collected once every two to four 
seconds and the state is updated once every few minutes (111. However, more frequent state 
measurements and estimations are required for modem SGs, which inevitably imposes strict 
delay constraints on the transmission of measurements. As a further example, in forest fire 
detection networks dH, measurements of smoke and gas sensors along with camera images are 
used to detect fire, and the delay inevitably becomes a major constraint for the transmission. 
Thus, zero-delay linear transmission (LT), rather than advanced compression and channel coding 
techniques that span large codewords, is an attractive strategy for the transmission of sensor 
measurements in intelligent networks. This is because LT reduces both the delay and encoding 
complexity significantly; and accordingly limits the cost and energy requirements of the sensors. 

LT of Gaussian sources has been extensively studied in the literature. Goblick showed in [[5l 
that zero-delay LT of a Gaussian source over an additive white Gaussian noise (AWGN) channel 
achieves the optimal mean-square error (MSE) distortion. In [|6]|, the optimal LT scheme that 
matches an r-dimensional Gaussian signal to a fc-dimensional AWGN vector channel is character¬ 
ized. It is shown that the optimal LT performance can be achieved by mapping ordered sources to 
ordered channels in a one-to-one fashion. LT of a Gaussian source over a fading AWGN channel 
is studied in [|71. It is shown that the optimal LT performance can be achieved by symbol-by¬ 
symbol processing, and increasing the block length does not provide any gain, as opposed to 
nonlinear coding schemes. In [[81, LT of noisy vector measurements over a fading AWGN channel 
is studied under diagonal and general observation matrices. LT of vector Gaussian sources over 
static and fading multi-antenna channels is studied in 10 and lITOl . respectively. 

We consider a wireless sensor node that collects measurements from J Gaussian parameters. 
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We discretize time into time slots (TSs), and assume that the CC asks for a measurement of a 
particular parameter from the sensor at each TS. The sensor takes one sample of the requested 
parameter at each TS, and transmits these samples to the CC over an AWGN fading channel 
under a given delay constraint. Note that, in contrast to multi-dimensional Gaussian source 
models studied in m, ma, ed, where the sensor has the measurements of all the J Gaussian 
parameters at the beginning of a TS, we assume that only one measurement is taken from the 
requested parameter at each TS. 

We assume that each measurement must be delivered within d TSs. Thereby, after each 
transmission, the CC estimates the measurement whose deadline is just about to expire. We 
assume that the channel gain from the sensor to the CC is independent and identically distributed 
(i.i.d.) over TSs. We consider two different scenarios regarding the channel state information 
(CSI); In the first scenario, the CSI is assumed to be available to both the encoder and decoder, 
while in the second scenario, only the decoder has CSI. Our goal is to estimate all the requested 
measurements at the CC within their delay constraints with the minimum MSE distortion. 

We focus explicitly on LT strategies. Assuming that the CSI is known by both the encoder 
and decoder, we first derive the optimal LT strategy under a strict delay constraint {d = 1), and 
show that the optimal power allocation and the corresponding distortion can be interpreted as 
water-filling reflected on a reciprocal mirror. Exploiting the results of |l6l, we also derive the 
optimal LT strategy under a strict delay constraint for a particular scenario in which the sensor 
transmits the measurement vector over parallel AWGN fading channels at each TS. Then, building 
on our previous works ifT^ . [fTSll . and exploiting the optimal LT strategy derived for multiple 
measurements-parallel channels scenario above, we propose two LT strategies for general delay 
constraints. In both strategies, measurements are first collected and stored in a buffer whose 
size depends on the delay constraint, and then, are transmitted to the CC over multiple channel 
accesses within the delay constraint. The two strategies consider different measurement selection 
criterias, which are used to select the appropriate stored measurement to be transmitted at each 
channel access. We then derive the theoretical lower bound (TLB) and the LT lower bound (LLB) 
on the achievable MSE distortion. We characterize the MSE distortion achieved by the proposed 
LT schemes, as well as the TLB and the LLB under various power and delay constraints. We 
show that the MSE distortion diminishes as the delay constraint is relaxed if the sensor is capable 
of measuring more than one system parameter, i.e., J > 1. However, if J = 1, then relaxing the 
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Fig. 1. The illustration of the transmission model from the perspective of the sensor with multiple channel accesses. 


delay constraint does not provide any improvement in LT performance as argued in 10. When the 
fading channel follows a discrete distribution and the delay constraint is completely removed, we 
show that the proposed LT strategies meet the TLB under certain matching conditions between 
the channel states and the paramater variances; and hence, achieve the optimal performance. 

When the CSI is known only by the decoder, we first derive the optimal LT strategy under a 
strict delay constraint. Then, we consider the multiple measurements-parallel channels scenario 
under a strict delay constraint and J > 1 assumption, and show that the optimal LT performance 
cannot be achieved by an LT scheme that is constrained to use only a one-to-one linear mapping 
between measurements and channels, as opposed to the J = 1 case |[71, and the CSI is known 
by both the encoder and decoder 0, respectively. Since the optimal LT strategy is elusive for 
J > 1, we do not consider LT strategies for larger delay constraints. Finally, we derive the TLB 
on the achievable MSE distortion. 

The rest of the paper is structured as follows. The system model is presented in Section HIl In 
Sections |nl] to |V] CSI is assumed at both the encoder and decoder. In Section Uni we study the 
optimal LT strategy under a strict delay constraint. Two LT strategies are proposed for general 
delay constraints in Section UVl In Section |Vl we characterize the TLB and LLB on the achievable 
MSE distortion. In Section IVH the optimal ET strategy is derived under a strict delay constraint 
along with the TEB, when the CSI is known only by the decoder. Section IVTII presents extensive 
numerical results, and finally. Section IVIIII concludes the paper. 

11. System Model 

We consider a CC that monitors the operation of a system through a wireless sensor (Eig. [T]), 
which is capable of measuring J distinct system parameters. The jth system parameter is 
modelled as a zero-mean Gaussian random variable (r.v.) with variance (j|, i.e., AA(0 ,(j|), for 
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j G where [1:J] denotes the set {1,2,, J}. These system parameters are independent 
from eaeh other, and their realizations are i.i.d. over time. In order to monitor the network 
operation, the CC requests the measurement of one system parameter from the sensor at eaeh 
TS. The index of the requested system parameter at eaeh TS is a r.v. denoted by M G [1: J], with 
distribution whieh is also i.i.d. over time. Based on these requests, the sensor takes one 

measurement of the requested parameter m at eaeh TS. Thereby, the model is that of a eomposite 
souree introdueed in Chapter 6 of llT4l . The souree S ean be deseribed as a eomposite souree 
eomprised of J distinet eomponents (subsourees), eaeh operating independently of the others. 
In our model, eaeh eomponent produees data aeeording to a Gaussian probability distribution 
P{-\m) = AA(0,(T^). The set G of all sourees eomprises the eomposite souree {5,,^}. In our 
ease, 

G= [AA(0,a2),AA(0,a2),...,AA(0,a})]. (1) 

The index of the requested system parameter m generates the sequenee of positions assumed 
by the switeh in Fig. [IJ In our model both the eneoder and the deeoder possess the exaet 
knowledge of this sequenee. Notiee that, in the partieular ease in whieh the eneoder and deeoder 
are uninformed about this sequenee, the eomposite souree {S'*, G} is equivalent to a mixture of 
Gaussian distributions, i.e., Ps{s) = J2m=i PM{m)Ps\M{s\m). 

We assume that the CC imposes a maximum delay eonstraint of d G Z+ on the measurements, 
that is, the measurement requested in a TS needs to be transmitted within the following d TSs; 
otherwise, it becomes stale. The collected sensor measurements are transmitted to the CC over 
a fading channel with zero-mean and unit variance AWGN. The channel output at TS i is given 
hyy. = h iXi -f- Zi, where Xi is the channel input, Zi is the additive noise with Z AA(0,1), 
and hi is the fading state of the channel. We consider a fading channel model, and assume that 
the fading coefficient ifj G M is modelled as a r.v. i.i.d. over time with probability distribution 
Pnih). 

We define m- = [mi,mi+i, as the sequence of indices of requested parameters at TSs 

[i:l] for i < 1. The measurement sequence is defined similarly as s- = [sj,..., si], where the i-\h 
entry s* is the measured value of the requested parameter at TS i. Therefore, the sequence s* 
has independent entries, where the i-th entry comes from a Gaussian distribution with variance 
cr^,. Note that in our composite Gaussian measurements model, conditioned on the requested 
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parameter index, whieh is known by both the eneoder and deeoder, the souree samples follow 
Gaussian distributions with different varianee values. 

The ehannel state and the output sequenees at TSs [r.l] are similarly defined as h- = [hi,hi] 
and y- = respeetively. We assume that both the eneoder and deeoder at TS i know all 

the past ehannel states, and the indiees of requested parameters, nip as well as the statisties 
of the measured parameters, cx^, the parameter requests, pMim), and the ehannel, puih). In the 
first part of the paper we assume that both the eneoder and deeoder know the eurrent ehannel 
state, hi. Note that this assumption might be hard to realize for a fast fading ehannel model; 
on the other hand, our system model ean be eonsidered as instanees of a slow fading ehannel. 
Typieally, there will be a large number of sensors in the system, and eaeh sensor is going to 
be seheduled only onee in a while; and henee, eaeh TS in our system model ean be eonsidered 
as one instanee of a slow fading ehannel when a partieular sensor is seheduled to transmit. 
Sinee these instanees are separated from eaeh other due to the transmission of other sensors, 
eorresponding ehannel states are modeled as i.i.d., and are assumed to be known by both the 
eneoder and deeoder, as ehannel estimation and CSI feedbaek ean be earned out between two 
transmissions of the same sensor. In Seetion we will eonsider the seenario in whieh the CSI 
is known only by the deeoder. 

1) Encoding Function: The eneoding funetion /j : M* x M® x M* —>■ M, maps hp and m®]^ 
to a ehannel input Xi at eaeh TS i, i.e., Xi = fi{s\, h®]^, m®^). An average power eonstraint of P 
is imposed on the eneoding funetion: 



where Em,h,s [•] denotes the expeetation over M, H and S. For any generie transmission poliey. 


the eneoding funetion /j, at TS i, may eonsider to use any eombination of s®p hp and to 


generate Xi. This gives rise to a time-varying eneoding seheme. 

2) Decoding Function: At the end of TS i, the goal of the CC is to estimate with the minimum 
MSB distortion, the measurement whieh has been requested exaetly d — 1 TSs ago, and 

is about to expire. The deeoding funetion pi : W x W x W ^ M, estimates Sj_d+i based on yp 
h®]^, and m®^, i.e., Sj-d+i = giiyl, ™i)- The MSB distortion is given by: 
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D = lim — 

n—>-oo 77, 


Em, 


H,S,Z 




i=d 

The decoding function gi, at TS i, reconstructs the measurement using and Hence, 

similarly to the encoder, the decoder may be time-varying. 

We are interested only in LT policies in which /j’s are restricted to be linear functions of the 
sensor measurements, s/s, i.e., ■ s^, where fi(h\,m\) is a vector. 

Under this linearity constraint, the optimal estimators at the receiver, ^/s, are also linear functions 
of the channel outputs, 7/j’s, i.e., = gi{h\, ■ y}, where gi{h.\, is a vector. 

Hereafter, we will refer to fi and gi for the encoding and decoding functions at TS i, respectively. 


HI. Strict Delay Constraint 


We first derive the optimal LT strategy under a strict delay constraint (d = 1), and characterize 
the minimum achievable MSB distortion. In this scenario, optimal LT performance is achieved 
by transmitting only the most recent measurement since all the previous measurements have 
expired, and transmitting an expired measurement cannot help the estimation of the current 
measurement since the measurements are independent. Then the encoding function fi{hi,Tni) at 
TS z is a scalar. Given the encoding function, the decoding function gi{hi,mi) that minimizes 
the MSB for Gaussian r.v.s is the linear MMSB estimator IfTSlI . and is also a scalar. 

In particular, for a measurement s* with variance cr^., and channel output yt = hi - fi{hi,mi) ■ 
Si + Zi at TS i, the decoding function can be written explicitly as: 


_ Es,z[SiYi\ \hi\fi{hi,mi)al^. 


( 2 ) 


In the following lemma we show that there is no loss of optimality by limiting the encoding 
function to be time-invariant. 


Lemma 1. Under a strict delay constraint there is no loss of optimality by considering only 
time-invariant encoding functions, i.e., ffhi^rrii) = f{hi,mi) \/i. 


Proof: 






D — lim — ^M,H,s,z \Si — Si 

n^oo Ti ' ^ 


n 

= lim — E 


i=l 


n—>oo n 


M,H 


i=l 


O'" 


_\h\^fi{h,myal^ + ij ’ 


> Em,h 


cr: 


(3) 

(4) 


\h\^f{h,mYal^ + 1 

where dU) is the average MSE distortion under a striet delay constraint (d = 1); and defining 

n 

f{h,mY = lim„_j.oo 4 ^/j(h, m)^ such that f{h,m) satisfies the average power constraint 


i=l 


P, dH) follows from the convexity of the function Em,h 


in terms of fi{h,mY, 


_\h\^fiih,myal+l_ 

and the equality holds iff fi{h,m) = f{h,m) for Vi and due to the strict convexity of the 
aforementioned function. Thus, the time-invariant encoding function f{h, m), which is a function 
of only h and a^, does not lead to any loss in optimality. ■ 

The time-invariant encoding function f{h,m) leads to a time-invariant decoding function 
g{h, m). In the rest of the paper, we will consider time-invariant encoding and decoding functions 
without loss of optimality. Then, the MSE distortion, D = Em,h,s,z [[S' — and the average 
power, P = Em,h,s^XW can be written explicitly as functions of h and as follows: 


,/ 


D = 

m=l 
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|d|2/(/i,m)V^ + 1 

P I f{h,7nfal^pH{h)dh. 


PH{h)dh, 


(5) 

( 6 ) 


m=l 


The optimal linear encoding function f*{h,m) is found as the solution to the convex op¬ 
timization problem D* = minZ), subject to the average power constraint P < P. Erom the 
Karush-Kuhn-Tucker (KKT) optimality conditions IfT^ . we obtain: 
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A* 

I ^1 


1 

1 ^ 


-I- 


where A* is the optimal Eagrange multiplier, such that P = P. 

The optimal power allocation and the corresponding distortion are given by: 


P*{h,7n) 
D*{h, m) 


L 

a, 


X* 


\h\ 


mm 





(7) 


( 8 ) 

(9) 























Fig. 2. Water-filling reflected on a reciprocal mirror. 


where D* = Ejvf.w [D*{h^m)] and [P*(h,m)] = P. 

In Fig. [2l we present a graphical interpretation of the optimal power allocation and the 
corresponding distortion for J = 2 parameters with variances af and cr|, which are requested 
with probabilities pm(1),Pm( 2), respectively. We also consider a discrete fading channel with 
three states, where the kth state, hk, is observed with probability pnihk), k = 1,2,3. Fig. [2] 
depicts rectangles that are placed upon a mirror surface and their reciprocally scaled images 
below. Rectangles represent all possible source-channel pairs {am, hk}, where Ikm — 

Wkm = 1X7 indicate the height and width of the rectangles, respectively. The total power is 

\'^k \ 

poured above the level Ikm up to the water level A* across the rectangles placed upon the mirror. 
The optimal power allocated to the source-channel pair [am, hk} is given by the shaded area 
below the water level and above Ikm- The corresponding distortion values are found by simply 
looking at the reciprocally scaled reflections of the rectangles and the water level on the mirror. 
If 7 ^- > -T, distortion is given by the width Wkm times the reciprocal of the water level and 
if 7 ^- < -T, distortion is al,, which are illustrated as dashed areas in Fig. [2l We call this as 

f'km ^ 

water-filling reflected on a reciprocal mirror. 

A. Multiple Measurements and Parallel Channels 

Next, we assume that the CC requests > 1 measurements from the sensor at each TS, 
and the sensor transmits a length-iV measurement vector over N parallel orthogonal AWGN 
fading channels under a strict delay constraint (d = 1). For this scenario, we characterize the 
optimal LT strategy by generalizing the results of BH derived for Gaussian vector sources to our 
composite Gaussian measurements model. This scenario differs from the system model defined 
































10 


in Section |nl since we allow to take N measurements at each TS as opposed to taking only one 
measurement at each TS. However, we will exploit the optimal LT strategy in this setting for the 
construction of the proposed transmission strategies in Section |IVl as well as for characterizing 
the LLB in Section IV-BI 

Only for this scenario, we define m = [mi, as the vector of indices of N re¬ 

quested parameters at a particular TS. Then, the sensor takes the length-measurement vector 
s = [si,..., sn] according to the parameters indicated by m, i.e., si is the measured value of 
parameter mi. For a strict delay constraint {d = 1), the optimal LT performance is achieved 
by transmitting only the most recent measurement vector. Similarly to Lemma [H the encoding 
function can be limited to a time-invariant N x N square matrix Fh,m without loss of optimality, 
where subscripts h and m indicate the dependence of the encoding matrix on the realizations 
of h and m. We assume that the N channels are i.i.d with distribution pnih), and H is defined 
as the N X N diagonal channel matrix. The diagonal elements of H are denoted by a channel 
vector h = [hi,..., at a particular TS. The length-iV channel output vector at that particular 
TS is given by y = Hx -f z, where x is the length-iV channel input vector and z is the length-iV 
additive noise vector with z ~ Af{0, 1). 

The encoder at any TS maps its measurement vector s, to a channel input vector x, i.e., 
X = Fh,mS- An average power constraint of P is imposed on the encoding function: 

P = i {Em,«,s|xx^} = i IV {EM,«lFh,„C,Fj„|} < P, (10) 

where Cg = E 5 [ss'^]. 

Given the encoding function, the decoding function that minimizes the MSE for a Gaussian 
random vector is the N x N linear MMSE estimator matrix Gh,m IH?! . which is also time- 
invariant. Similarly to Fh,m> subscripts h and m indicate the dependence of the decoding matrix 
on the realizations of h and m. Eor the measurement vector s, and the channel output vector y, 
at any TS, we have: 

Gh,m = CgyC;^ = (11) 

where Cgy = E 5 ,z[sy^], Cy = Es,z[yy^] and $ ^ (HFh,„.CgF?;„,H^ +1)^1. 

At any TS, the CC estimates the most recent measurement vector s as s, i.e., s = Gh,my- 
The MSE distortion is given by: 




II 



The optimal linear encoding matrix Fj^ is found as the solution to the convex optimization 
problem D* = min D, subject to the average power constraint P < P. For a set of static parallel 
AWGN channels and Gaussian vector sources, the optimal linear encoding matrix transmits one 
measurement over each channel |0 . The optimal mapping between channels and measurements 
is as follows: We first reorder the measurement vector s to obtain s = [s(i),..., S(iv)], such that 
<7^(1) ^ *^^( 2 ) ^ ■ ■ ■ < reorder the channel vector h to obtain h = , h(N)], 

such that I /i(i) I < I /i( 2 ) I < • • • < I \. Then, the optimal linear encoding matrix is diagonal 
with entries m(i) at)(/?,(A r),m(Ar))], and it maps the ordered measurements to 

ordered channel states. In order to find the diagonal entries of F^ we can explicitly rewrite 
the convex optimization problem by using the optimal mappings derived in 0, as follows: 



(13) 


where the expectation is taken over and for t G [PN]. The f-th smallest entry of the 
requested parameter vector m = [mi, m 2 ,..., m^v], is denoted by the r.v. M(t) G [1: J] with the 
order statistics Without loss of generality, we assume that ordering the entries of m in 

ascending order, i.e., m(i) < m( 2 ) < ■ ■ • < implies ordering the entries of the measurement 
vector s in ascending variances, i.e., < • • • < Similarly, the f-th smallest 

entry of the channel vector h = [/ii, / 12 , • • •, is denoted by the r.v. i/(i) G M with the order 
statistics 

The optimal linear encoding matrix F^ with diagonal entries for t G [1:N], 

can be found from the Lagrange and the KKT conditions as follows: 



+ 


( 14 ) 
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where <5* is the optimal Lagrange multiplier, sueh that P = P in (fT3]) . 

Similarly, the optimal power alloeation and the eorresponding distortion ean be found by using 
the water-filling reflected on a reciprocal mirror interpretation. The optimal Lagrange multiplier 
depends on pM^^){rn) and which can be found explicitly by using the order statistics. 

In the following lemma, we give the t-th order statistics pM^^s^{rn) and pH^^^{h), for t G [I'-N]. 

Lemma 2. Let Pjvf(m) and Fnih) denote the cumulative distribution functions of pm{,^) cind 
Pnih), respectively. Given FM^m), pMim), Fuifii), puih) and N, the t-th order statistics 
and pH(t){h), t e [l-'iV], are found as: 

Pn,,,W = - Fn(h) f-‘, (15) 

^ ^ j [PM(m)''(l - FM{rn)Y~^ - FM{m - 1)^(1 - Fm^tti - 1))^"'’] . (16) 

b=t ^ ' 

Proof: The proof is trivial and achieved through the definition of the cumulative distribution 
functions of H(^t) and 


= Pr{P(i) < h} = Pr{at least t of P’s are < h}, (17) 

where (fTVl) implies a binomial distribution with at least t successes and can be formulated as (fTSl) . 
The f-th order statistics PH^^t){h) is found by taking the derivative of (fT^ with respect to h. The 
same proof holds for M(p. m 


IV. LT Strategies 

In this section, we propose two LT strategies for general delay constraints d > 1. The block 
diagram of the proposed LT strategies is illustrated in Fig. [3l Both strategies are composed 
of two main blocks, namely, storage and transmission blocks. There are two buffers of size d 
measurements, namely, the measurement buffer (MB) and the transmission buffer (TB). Here, 
we present these two schemes for an odd delay constraint, i.e., d G {1, 3, 5,...}, but they can 
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Fig. 3. The block diagram illustration of the proposed LT strategies. 


be easily adapted to the case when d is even. In the storage block, given a delay constraint of 
d = 2d — 1 for (J G [l:oo], the sensor collects a block of d consecutive measurements after d 
consecutive TSs, and stores them in the MB. The consecutive blocks of d measurements, taken 
over successive time intervals, are indexed by k = {1,2,...}. Then, the k-th block consists of 
the measurements taken within TSs [(1 + {k — l)d):kd], i.e., When the MB gets 

full with the d measurements of the k-th block, the sensor removes ,t from the MB 
and loads them into the TB. Then, for the next consecutive d TSs [kd:{(k + l)d — 1)], the 
sensor accesses the channel and transmits a linear function of the measurements in the TB, i.e., 
over the channel states satisfying the delay constraint d. The specifics of 

these linear functions will be explained below. 

Note that, while the sensor transmits the measurements in the TB, it starts refilling the MB 
with new measurements After d channel accesses within TSs [kd:{{k + l)d — 1)], the 

MB gets full again and its new d measurements are transferred to the TB for transmission over 
the next d TSs. 

The proposed transmission strategies consist of two sub-blocks, namely, the measurement 
selection and scaling sub-blocks. This division is motivated by the results of (fill presented in 
Section ITTT-A[ in which N ordered measurements are mapped one-to-one to N ordered channels, 
and each measurement is transmitted over its corresponding channel. Hence, we assume that, at 
each channel access, the sensor selects only one measurement and scales it to a channel input 
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value. However, in this ease, we eannot direetly use the optimal LT seheme in @ and guarantee 
that the seleeted measurement and the ehannel state satisfy the optimal matehing. This is beeause 
even though d measurements are available in the TB in advanee, the states of the next d ehannels 
are not available to the transmitter as in the parallel ehannel model of and instead, they 
beeome available over time. The two proposed LT strategies differ in the way they ehoose the 
measurement to be transmitted at each TS. 


A. Linear Transmission Scheme with Hard Matching (LTHM) 

This transmission scheme has the following measurement selection criteria. Assume, without 
loss of generality, that parameters are ordered such that > ct| > • • • > cij. We divide the 
channel magnitude space (M’*') into J ordered channel intervals as, Hm = where 

H:n < for m e [1:^]. The boundary values are chosen as H'q = oo, H'j = 0 and H'^ = 

m 

- Epm(j))> for m e 1 )], where denotes the inverse of the cumulative 

i=i 

distribution function of the channel magnitude \ h\, FH{\h\). Observe that according to this choice, 
the probability of the channel magnitude belonging to Tim is Pr{|/i| G Tim] = PM(m)Ii] 

The algorithmic description of LTHM is given in Algorithm [B Let b = [ 6 i, 62 , • • •, & j] be 
a J-length vector, where the m-th entry, bm G [0:(i], denotes the number of measurements of 
parameter m in the TB, for m G [I'-J]. At each channel access, if \h\ G Tim and bm 7 ^ 0, 
then the sensor selects one measurement of the parameter type m from the TB and feeds it 
to the scaling sub-block. If there are multiple measurements of the same parameter type m in 
the TB, i.e., bm > 1, then the sensor selects one of them randomly. The selected measurement 
is removed from the TB and b is updated by reducing the m-th entry, bm, by one. Thereby, 
each measurement is transmitted only once. On the other hand, if \h\ G Tim and bm = 0, no 
measurement is transmitted in that TS. Hence, LTHM considers a hard matching condition for 
selecting measurements, in which each parameter has a corresponding interval of channel states. 


*If channel fading follows a discrete distribution, we define sets of channel states as opposed to intervals. With abuse of notation, 
we denote the mth set as Hm, for m G Suppose that the discrete channel states are ordered as \hi \ > |/i 2 | > l^isl > • • • • 

We allocate the discrete states into J sets such that the probability of channel state falling into set is pM{m). However, 
it may be possible that the channel states cannot be grouped to satisfy this equality exactly for all m. In that case we create 
virtual states to satisfy these equalities, as explained below. 

Let j be the minimum index for which 'F,i=iPn{\h\ = hi) > pm(1). Define p\f = Pm(1) — 'l2iZi Pn{\h\ = hi). We 
define a new virtual channel state hj , whose gain is equivalent to hj . Whenever the real channel state is hj , we randomly assign 
the channel state to hj with probability p\f /pM{j). We let Hi = {^i,..., hj-i, hj}. We repeat the same process for pm{2), 
starting with channel state hj whose probability is now pnihj) — Pm. 
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and only measurements of that parameter ean be transmitted over a channel state from that 
interval. Note that, since the channel state is known at the receiver, it also knows which type of 
measurement is transmitted at each TS. 

For the scaling sub-block we use the power allocation strategy derived in Section Hill Thus, 
the selected measurement of the parameter type m is transmitted at the current channel state 
\h\ e T-irn, for rn G by allocating power P{h,m), leading to distortion D{h,m): 


P{h, m) 



if hard matching holds, 
otherwise. 


(19) 


if hard matching holds, 

\h?P{h,m)+V S ^20) 

Otherwise, 

where /r is chosen such that the average power constraint is satisfied. 

After every transmission, the CC estimates the transmitted measurement s by using the 
channel output y. It is noteworthy that after d channel accesses, we may have untransmitted 
measurements in the TB. TB is emptied anyway since these measurements have expired, and 
they are estimated with the maximum distortion cr^. As we show next, the average number 
of untransmitted measurements decreases with the increasing delay constraint d. However, for 
a finite delay constraint the untransmitted measurements dominate the distortion even for a 
high average transmission power constraint. In order to combat this drawback, we propose an 
alternative LT scheme. 


B. Linear Transmission Scheme with Soft Matching (LTSM) 

The algorithmic description of LTSM is given in Algorithm [T1 The LTSM retains the hard 
matching condition of LTHM, i.e., at each channel access, if |/z.| G Pm and 6^ 7 ^ 0 for m G [PJ], 
LTSM selects one measurement of the parameter type m from the TB. Hence, LTSM also 
gives the highest selection priority to the measurement of the parameter type that satisfies the 
hard matching condition with the channel state. However, if |/i| G Pm and bm = 0, LTSM 
does not waste the channel state; and instead, selects one measurement based on the following 
measurement selection criteria: 
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Algorithm 1 LTHM and LTSM 


1 : 

2 : 

3: 


4: 


5: 

6 : 


Initialization: 

Load measurements of MB, into TB and update 

for i = kd to {k + l)d — 1 do > 

if \hi\ G T-Lm and bm ^ 0 then > 

Seleet one measurement of parameter m from TB. 

Transmit the measurement over \hi\ with an allocated power of 

bm ^ bfji 1 

else 

Find ^ by solving min 11 h* | — h' | . 

Select one measurement of parameter from TB. 

Transmit the measurement over | hi \ with an allocated power of 

bm ^ bm 1 


b. 

TSs for d channel accesses 
both for LTHM and LTSM 

> measurement selection 

Eqn. (fT9l) . > scaling 

> update b 
> only for LTSM 

> measurement selection 

Eqn. (fT9l) . > scaling 

> update b 


end if 
end for 

k k + 1 and go to Initialization 


Assume that each interval Tim is further divided into two equally probable intervals by the 
boundary value h'm = ^ If \h\ G Tim and bm = 0, 

then LTSM selects one measurement of parameter which is the parameter that minimizes the 
following distance metric: 


min \\h\ — h'\ . ( 21 ) 

I' ' 

When the hard matching condition is not satisfied, the LTSM considers a soft matching 
condition for selecting measurements; that is, among all parameter types of the measurements in 
the TB, it selects a measurement of the parameter whose corresponding interval of channel states 
has the value h'^ closest to the channel state magnitude \h\. If two distinct ^ values satisfy the 
solution of Eqn. (1^ . then LTSM chooses the smallest value of <^. LTSM allocates the power as 
in Eqn. (fT9l) . and transmits the selected measurement, leading to distortion in Eqn. (l20l) . Note that 
the optimal Eagrange multiplier /i is chosen such that the average power constraint is satisfied. 
At the end of d channel accesses, the sensor will have transmitted all the measurements in the 
TB, albeit some might have been allocated zero power as a result of the water-filling algorithm. 

^If the channel follows a discrete fading distribution, we find /iL by taking the mean value of all elements of channel set 
■Hm. 
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V. Distortion Lower Bounds 


We characterize two lower bounds on the MSE distortion, namely, the TLB and the LLB. 
While the TLB is the theoretical performance bound derived without any delay or complexity 
constraints on the transmission, the LLB is a performance lower bound only for LT strategies. 
We also prove that the proposed LT strategies meet the TLB under infinite delay and certain 
matching conditions between the channel states and parameter variances. 

A. Theoretical Lower Bound (TLB) 

Shannon’s source-channel separation theorem states that the optimal end-to-end distortion is 
achieved by concatenating the optimal source and channel codes when there is no delay or 
complexity constraints, and the source and channel distributions are ergodic ifTTll . When we 
remove the delay and linear encoding constraints in our system model, then the sensor can 
transmit to the CC at the ergodic capacity, Ce, of the underlying fading channel, while the 
minimum distortion, Df., is found by evaluating the distortion-rate function for a composite 
Gaussian source model at the ergodic capacity. 

Since the channel state is known by both the transmitter and receiver, the ergodic capacity, in 
terms of the optimal power allocation scheme P*{h), is given by: 


c.aeh iiog(i + |ftfp;(ft)) 


( 22 ) 


where P*{h) is found by the water-filling algorithm as P*{h) = [a* — l/|/;,p] + , where a* is 
chosen to satisfy P^ = Eh [Pe{h)] = P. 

Lrom Eqn. (6.1.21) of [lT4l . the distortion-rate function of a composite Gaussian source with 
m components, AA(0, a^), each of which is observed with probability pM{m) for m G [1:T], is 
defined as: 



(23) 


where the optimal rate allocated to source m, R*(am), and the corresponding distortion, D2{am), 
are given by: 



( 24 ) 
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= min (/?*, a^) , (25) 

where (3* is chosen such that Rf. = [Rl{am)] = C^- 

Hence, the optimal distortion is found as = Em [Dl{am)], which is the TLB on the 
achievable MSB distortion by any transmission strategy. Note that we have removed both the 
delay constraint and the linearity requirement on the encoder and decoder. 

1) Asymptotic Optimality of LT: In general, the TLB cannot be achieved by LT strategies even 
if the delay constraint is removed. However, it can be shown that LTHM and LTSM meet this 
lower bound when the delay constraint is removed under certain matching conditions between 
the channel states and the parameter variances. 

Assume that the channel follows a discrete fading distribution, where the channel state h 
can take one of the J values hm with probability pH{hm) for m G The discrete values 
are ordered as |hi| > \h 2 \ > > \hj\. The next theorem states the necessary conditions 

in this discrete channel model under which LTHM and LTSM achieve the optimal distortion 
performance when the delay constraint is removed. 

Theorem 1. For the discrete AWGN fading channel model, if the parameter variances and the 
discrete channel states satisfy = ■ ■ ■ = and Pm{^) = VHihm), for Vm G [l.'T], then 

|^l| \^j\ 

the TLB is achieved by LTHM and LTSM when the delay constraint is removed, i.e., d ^ oo. 
Proof: The proof can be found in Appendix A. ■ 

B. The Linear Transmission Lower Bound (LLB) 

We next derive a lower bound on the achievable MSB distortion as a function of the delay 
and power constraints for any LT strategy. In order to derive this lower bound, we relax the 
assumption on the causal knowledge of the measurements and channel states, and instead assume 
that the sensor has the offline (non-causal) knowledge of a certain number of future measurements 
and channel states. Accordingly, we assume that at any TS the sensor non-causally knows the 
length-M measurement vector, i.e., s = [si,... taken over the next u TSs. Observe that, 
for a delay constraint d, each measurement of s can only be transmitted over the following d 
channel states observed after it is taken, thus the transmission of the vector s spans the following 
c = (d+u — l) channel states observed after the first measurement si is taken. We further assume 
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that the sensor non-eausally knows the length-c ehannel veetor h = [/ii,..., /ig]- Heneeforth, the 
problem is redueed to optimally transmitting u measurements over c parallel ehannels, whieh is 
attained by using the optimal LT seheme presented in Section ITTT- A 1 Accordingly, we first reorder 
s to get s = ... ,S(^i)], where the variances of the ordered measurements satisfy < 

^ ■ ■ ■ < reorder h to get h = , ^(s)], such that the ordered fading 

states satisfy |/r(i)| < |/r( 2 )| < • • • < |^(c)|- Then, the c x optimal linear encoding matrix 
consists of a -u X -u size diagonal partition with entries [/(i)(/i(i+e), m(i)),..., f(u)ih(u+e),fn(u))], 
and a exit size partition with zero entries, where e = c — u, and it maps u ordered measurements 
to the u channels with the largest gains. The optimal entries of FJ^^^ are found as the solution 
of the following convex optimization problem with the optimal objective function D*{d, u, P ): 


min 

f(t) 




1(t) 


U 

Ly _ 

U y |/l(t+g)P/(t)(/i(t+g),W(t))2cr2 




- ^/(t)(^(t+e)jnr(t)) (T„ 


'(t) 


t=i 


•(t) 


<P, 


( 26 ) 


where the expectation is taken over M(t) and if{t+e) for t £ The f-th and {t + e)-th order 
statistics and are given by Lemma [2l The optimal linear encoding matrix 

with diagonal entries is found as: 


f{€) {^it+e)y 


\ 


l%+e)l 


(Tr) 




\h{t+s)\‘ 


a, 


m(t) 


(27) 


where C* is the optimal Lagrange multiplier, such that P = P in (|2^ . 

Assuming non-causal knowledge of u measurements and c channel states under the delay 
constraint d and the average power constraint P, we obtain the optimal distortion D*{d, u, P) for 
any LT strategy. Then, the LLB is derived by finding the u value, which maximizes D*{d, u, P ): 


A [d, P) = max D*{d,u,P). (28) 

U 

Note that we have relaxed the constraint for the causal knowledge of measurements and 
channel states both at the encoder and decoder. The numerical comparisons of the LLB with the 
proposed schemes will be presented in Section IVIIi 
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VI. No CSI AT THE Encoder 

In this section, we assume that the CSI is known only at the decoder. We derive the optimal 
LT strategy under a strict delay constraint {d = 1), as well as the TLB on the achievable 
MSB distortion. Additionally, for the multiple measurements-parallel channels scenario studied 
in Section 1111- A [ we show that if the CSI is available only at the receiver, any LT scheme that is 
limited to a one-to-one linear mapping from the measurements to the channel input is suboptimal 
in general. The optimal LT strategy is elusive and it will be a non-trivial function of the source 
variances and the channel distribution. 


A. Strict Delay Constraint 


Under a strict delay constraint, the most recent measurement is transmitted at each TS. By 
applying Lemma [T] to this scenario, we can similarly show that there is no loss of optimality 
by considering time-invariant encoding functions, i.e., fi{m) = f{m), Vi Hence, the encoding 
function /(m) is a scalar and time-invariant. The decoding function g{h,m) that minimizes the 
MSB is the linear MMSE estimator ifTSll . and is also a scalar and time-invariant. Then, the MSE 
distortion, D = Em,h,s,z[\^ ~ the average power, P = Em,s[|-AP], can be written 

explicitly as: 


D 

P 


.1 

m=l 


a 


2 

m 




jPH{h)dh, 


m=l 


(29) 

(30) 


where P{m) = f{mYa‘^. The optimal linear encoding function, f*{m), is found as the solution 
to the convex optimization problem D* = min D, subject to the average power constraint P < P. 
Lrom the KKT conditions [|T^ . we have: 


f*{m) = 



(31) 


where T' ^ : M —)■ M is the inverse of the function \I/ : M —)■ R, that is defined as, \I/(P(m)) = 
Ik. {\h\^p^!n)+ip PttW^^- The optimal Lagrange multiplier A* is chosen such that P = P in (l30l) . 
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B. Multiple Measurements and Parallel Channels 

Next we consider the multiple measurements-parallel channels scenario studied in Section InT-Al 
under the strict delay constraint and the assumption that the CSI is known only at the decoder, 
and J > 1. In such a scenario, the optimal LT scheme of O, in which the ordered measurements 
are mapped one-to-one to ordered channel states, cannot be used directly. This is because, even 
though the encoder knows the N measurements, it does not know any of the channel states, and 
hence; cannot order them. For the special case where N measurements are observed from a single 
Gaussian source (J = 1), in |I71 the authors show that the optimal performance is achieved by 
transmitting one measurement over each channel. When J = 1, since N measurements all have 
the same variance, all orderings are equivalent, and the optimal LT performance is achieved 
by an LT scheme that uses only a one-to-one mapping between measurements and channels. 
However, this is not the case in general when J > 1. Since N measurements follow a composite 
Gaussian source model, the encoder can have measurements with different variances; and hence, 
we can exploit the diversity of the fading channel by transmitting a single measurement over 
multiple channels, instead of transmitting each measurement only once. Depending on the source 
variances, the former may surpass the best LT performance achieved by using only a one-to-one 
linear mapping. This is shown in the following lemma by considering a particular example. 

Lemma 3. Consider the LT of N measurements of a composite Gaussian source with J > 1 
components over N parallel AWGN fading channels. If the CSI is known only by the decoder, 
then the LT scheme that uses a one-to-one linear mapping between measurements and channels 
is suboptimal in general. 

Proof: The proof can be found in Appendix B. ■ 

C. The Theoretical Lower Bound (TLB) 

Similarly to Section IV-Al we derive the TLB on the achievable MSE distortion by using 
Shannon’s source-channel separation theorem. If the CSI is available only at the decoder and 
the average power constraint is P, then the ergodic capacity is given by: 


a>E 


H 


log(l+ \h\^P) 


(32) 
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Fig. 4. Achievable MSB distortion with LTFIM with respect to average power for different delay constraints in the discrete 
fading channel model. 


The distortion-rate funetion of a composite Gaussian source is defined as in Eqn. (|2^ of 
Section 1^^ which leads to the optimal rate allocated to source m, as in Eqn. (|2^ and 

the corresponding distortion, D*{am), as in Eqn. (1251) . respectively. The Eagrangian multiplier 
l3* for this case is chosen such that [R*{am)] is equal to the ergodic capacity Ce in (1^ . 
Then the TEB on the achievable MSE distortion by any strategy when the encoder does not 
have the CSI is given by [Dl{am)]- 

VIE Numerical Results and Observations 

Here we provide numerical results to compare the performances of ETHM and ETSM with the 
lower bounds, and to analyze the impact of the delay and power constraints on the performance. 
In our simulations, we consider J = 4 Gaussian parameters with variances {10, 5,1, 0.5}, which 
are requested with probabilities {0.1, 0.3, 0.4, 0.2}, respectively. Eor a continuous fading channel, 
we consider Rayleigh distribution with a scale parameter cu = 3, and for a discrete fading 
channel, we consider four states {\/T0, \/5,1, \/(h5}, which are observed with probabilities 
{0.1,0.3,0.4,0.2}, respectively. 

We illustrate the achievable MSE distortion versus average power under various delay con¬ 
straints with ETHM in the discrete channel setting in Eig. IH We observe that the MSE distortion 
diminishes as the delay constraint is relaxed. This is because a relaxed delay constraint provides a 
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larger number of measurements in the TB; and henee, more flexibility for the sensor in seleeting 
the appropriate measurement for eaeh TS. We note that this statement does not hold when J = 1, 
in whieh ease inereasing the bloek length does not provide any improvement As it ean be seen 
in Fig. m the MSE distortion eonverges to a fixed value as the average power value inereases. 
This is due to the additional distortion brought in by the untransmitted measurements in the 
TB. The average number of untransmitted measurements and their effeet on the MSE distortion 
deereases as the delay eonstraint is relaxed, sinee having a larger number of measurements in the 
TB inereases the probability of finding a measurement that satisfies the hard matehing eondition. 
In partieular, when the delay eonstraint is removed, as seen in Eig. |4l ETHM aehieves the TEB, 
and beeomes the optimal ET seheme, sinee the souree-ehannel matehing eonditions in Theorem [T| 
are satisfied for the setup eonsidered here. 

In Eig. |5l we illustrate the aehievable MSE distortion with ETSM with respeet to average 
power under various delay eonstraints in the eontinuous ehannel model. Similarly to ETHM, the 
MSE distortion diminishes as the delay eonstraint inereases. On the other hand, as opposed to 
ETHM, the MSE distortion aehieved by ETSM deereases monotonieally with the average power 
as illustrated in Eig. |51 This is beeause the performanee of ETSM does not suffer from a fixed 
distortion eomponent due to the untransmitted measurements. In addition, ETSM also approaehes 
the TEB as the delay eonstraint is relaxed. Although we do not expeet the ETSM to meet the 
TEB in this setting sinee the matehing eonditions of Theorem [T| do not hold, we observe in 
Eig. [5] that it is very elose to the TEB. 

Next, we eompare the performanees of ETHM and ETSM with eaeh other and with the TEB 
and the EEB. Eig. |6] shows the aehievable MSE distortion of ETHM, ETSM, the EEB and the 
TEB with respeet to delay eonstraint in the eontinuous fading ehannel model for an average 
power eonstraint P = 10 dB. As seen in the figure, the performanee of the TEB is eonstant 
sinee it is derived by eompletely removing the delay and eomplexity eonstraints. On the other 
hand, the EEB deeays slowly as the delay eonstraint inereases. As expeeted, the MSE distortion 
of ETHM and ETSM deerease as the delay eonstraint inereases. We ean see that ETSM meets the 
EEB under the striet delay eonstraint. As expeeted, ETSM always outperforms ETHM, while the 
gap between the two sehemes deereases with the inereasing delay eonstraint. The gap between 
the TEB and two sehemes also deereases with the inereasing delay eonstraint even though we 
do not expeet either of the sehemes eonverge to the TEB in this setting sinee the matehing 
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Fig. 5. Achievable MSB distortion with LTSM with respect to average power for various delay constraints in the continuous 
fading channel model. 



Fig. 6. MSB distortion versus delay constraint, d, in the continuous fading channel model for an average power constraint 
P = 10 dB. 

conditions of Theorem [T] do not hold. 

Finally, in Fig. |71 we illustrate the aehievable MSB distortion of LT and the TLB with respect 
to average power in the diserete ehannel model for the seenarios in whieh the CSI is known 
only by the deeoder, and by both the eneoder and deeoder. The MSB distortion of LT under 
strict delay eonstraint of d = 1 for both seenarios diminishes as the average power increases. 
However, there is a eonstant gap between the optimal performanees aehieved with and without 
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Fig. 7. The achievable MSB distortion of LT and the TLB with respect to average power in the discrete fading channel model 
with and without encoder CSI. 


encoder CSI at higher P values. On the other hand, the TLB for both scenarios meet as the 
average power increases since the gain from the optimal power allocation over different channel 
states disappears in the high power regime. 

VIII. Conclusions 

We have studied the delay-constrained LT of composite Gaussian measurements from a sensor 
to a CC over an AWGN fading channel. We have considered a wireless sensor that can collect 
measurements from J distinct Gaussian parameters. The CC asks for a measurement of a 
particular parameter from the sensor with a certain probability at each TS. In this framework, 
we have presented the optimal LT strategy under a strict delay constraint, and have given a 
graphical interpretation for the optimal power allocation scheme and the corresponding distortion 
value. Then, we have proposed two LT strategies, called LTHM and LTSM, under general delay 
constraints, and have provided numerical results to investigate the impact of the delay and average 
power constraints on the performance. We have seen that, if the number of parameters, J, is more 
than one, the MSE distortion decreases as the delay constraint is relaxed. We have also derived 
lower bounds on the achievable MSE distortion for generic and ET strategies. While ETSM 
outperforms ETHM at all delay constraints, we have shown analytically that both strategies 
meet the lower bound when the delay constraint is removed, under certain matching conditions 
between the parameter and the channel statistics. 
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We have also studied the seenario in whieh the CSI is known only by the deeoder. We have 
presented the optimal LT strategy under a striet delay eonstraint. We have derived a TLB on 
the aehievable MSB distortion by relaxing the delay eonstraint and the linearity requirement. 
We have also eonsidered the multiple measurements-parallel ehannels seenario under a striet 
delay eonstraint, and have shown that the optimal LT performanee eannot be aehieved by using 
only a one-to-one linear mapping between measurements and ehannels, as opposed to the results 
derived in [|6]| and [|3- The design of the optimal LT strategy for the multiple measurements- 
parallel ehannels seenario for arbitrary delay eonstraints is elusive, and is left as future work. 

Appendix A 
Proof of Theorem 1 

Given a delay eonstraint d = 2d — 1, let the r.v. m G [1:T], denote the total number of 
measurements of parameter m among d measurements loaded into the TB. follows a Binomial 
distribution with parameters d and puim). Henee, the probability of having k measurements 
of parameter m in the TB is given by Pz^{k) = Pr{Zm = k} = — PM{jn)Y~^. 

Similarly, eonsidering the diserete fading model presented in Seetion lV-Al[ let the r.v. Z^, m G 
[1: J], denote the total number of ehannels with state hm, after d ehannel aeeesses. Zm also follows 
a Binomial distribution with parameters d and pnikm)- Henee, the probability of observing k 
ehannels with state hm is given by Pz^{k) = Pr{Z^ = k} = (f)pr/(hm)^(l - 

Observe that after d ehannel aeeesses, the number of transmitted measurements seleeted from 
the TB with LTHM is given by mm{Zm, Z^}- On the other hand, the number of untransmitted 
measurements remained in the TB is given by [Zm — ZmY- Then, the average power, Poo, and 
the aehievable MSB distortion, Poo, of LTHM when d —)■ cxd are given by: 


Poo=lim^y^E2 A 
d^ocd ^ 

m=l 


min s Zmj Z„ 


PChm,m), 


(33) 


1 


m=l 




O'm + Ey 


^ ZmiZm 


min {Zm,Zm 


D{hm,m)\, (34) 


where the alloeated power P{hm,'rn) and the distortion D{hm,'m) are ehosen as in Bqn. (fT9l) 
and Bqn. (l20l) . respeetively. 
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In the rest of the proof, we use p{m) to refer to the eondition of Theorem [H i.e., = 

Puihm) = pim), Vm. Under this eondition, the expeeted value and varianee of and ean 
be found as, E[Zm] = E[Zm] = d-p{m) and Var[Zm] = Var[Zm] = cr|^ = d-p{m) ■ (1 —p{m)), 
respeetively. Let e > 0 be any positive number. Then, the Chebyshev’s inequality leads to the 
following inequalities, Pr{|Zm-(i-p(m)| > e-cx^^} < ^ and Pr{|Zm-d-p(m)| > e-az^} < ji- 
We define the interval X on the real line as, I = [d ■ p{m) — e ■ cz^^d ■ p{m) + e ■ <JzJ\- 

Next, we eompute (1^ and (l34l) by finding upper and lower bounds on the expeetation terms 
under the matehing eondition. Observe that, 

< Jim 2 [Zm]=p{m). (35) 

d^oo d ^ 

We ean also lower bound this term as. 


lim 

I /-7 ^vni^rx 

n —irv~i fl, 


min 


]im —E -7 

d—>-oo d 


iZm 


min \Zm,Z„ 


> lim — 


min ^ 


2 


7 J Zm 1 Z\ 
d^oo (1 

^p(m)(l — p{m)) 




= ]im I p{m) — 

d—^oo 




1-^) =p(m), 

as 


(36) 

(37) 

(38) 

(39) 


where (1371) follows from the law of total expeetation; (1381) follows from the definition of X, and 
the Chebyshev’s inequality; and (l39l) is obtained by setting e = d^. Sinee the upper and lower 
bounds in (l35l) and (l39l) are equal, we have shown that (l35l) eonverges to p{m) as d —)■ cx). 
Similarly, 
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lim y 

7 

a—>-00 (L 


= lim Ey y 

d-,00 d ’ 


[Zm — Zm\ 


[Z. 


(40) 


Zm 

Zm 




+E 


Zm \Zri 


{Zr. 


Zm^X 

or 

Zm^X 


PrjZm^X or Zm0^} 


< ]im ^<j 2ecTz„ +{— + —) 

d —^00 CL 


= lim 

d^oo 


2^^/p{m){l — p{m)) 


dt 


+ 


= 0 , 


(41) 

(42) 

(43) 


2 1 
—2 4“ _4 
da ds 

follows from the from the definition 


where (|4TI) follows from the law of total expeetation; 
of X, and the Chebyshev’s inequality; and (1^ is obtained by setting e = d^. This proves 
that (l40l) indeed eonverges to zero as d —)■ cxd. This also implies that as d —)■ 00 , all seleeted 
measurements by the LTSM strategy satisfy the hard matehing eondition. Henee, LTSM and 
LTHM are equivalent in the asymptotie of d ^ 00 under the matehing eondition of Theorem [T] 
Finally, we ean rewrite Poo and Doo for both LTHM and LTSM as: 

7 ^ 


P = 

^ 00 


5:[. 

m=l 

J 

Doo = ^ 


/i q 


\K 


p(m), 


2=1 l\h. 




p*q — 




+ Ij 


p{m), 


(44) 

(45) 


where we use q = Vm, from Theorem [H and p* is ehosen to satisfy Poo = P- 

\hym\ 

Next, we show that (Poo, Poo) pair above, obtained under the eonditions of Theorem [U aehieve 
the TLB pair (Pe, Do), derived in Seetion IV-AI First, under the matehing eondition, observe that 
p*q = a*, and thus, Poo = Pe = P- Moreover, under the matehing eondition, = Co in TLB im- 
plies a* = ^. Combining the two equalities, we obtain p* = Substituting this into Eqn. (1231) 

together with the matehing eondition, we ean show that Z)e = X) p(^) = Poo, 

whieh eoneludes the proof of Theorem [T] 
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Appendix B 
Proof of Lemma 3 


In order to prove Lemma [H we eonstruct a counter-example. We argue that the achievable MSE 
distortion of a particular LT scheme that is not constrained to use only a one-to-one mapping 
between measurements and channels can be smaller than the minimum achievable MSE distortion 
of all possible ET schemes that use only a one-to-one mapping, i.e., a diagonal encoding matrix. 
Suppose we have J = 2 zero-mean Gaussian parameters with variances af and cr^, which are 
requested with probabilities Pm(1) = Pi and Pm{2) = p 2 = (1 — Pi), respectively, and assume 
an extreme case, where af > 0 and al = 0. Suppose we have a discrete fading channel with 
two states, which are observed with probabilities PHi{hi) = pi and ^^ 2 (^ 2 ) = P 2 , respectively, 
and assume that the channel states are hi > 0 and h 2 = 0. We aim at linearly transmitting 
N = 2 measurements of parameters mi E [1:2] and m 2 E [1:2], over N = 2 channel states 
hi E {hi, 112 } and h 2 E {hi, 112 }. 

We first characterize the minimum achievable MSE distortion, Di, for all possible ET schemes 
with a diagonal encoding matrix. According to Eqn. (fTOl) . the encoding function needs to satisfy 
the average power constraint P, i.e., ^ [PnPi -f P 12 P 1 P 2 + P 21 P 1 P 2 + P 22 PI] = P, where Pmim 2 
is the allocated power for the pair of measurements of parameters mi and m 2 , respectively. We 
have P 22 = 0, since = 0. Then, by using Eqn. (O, the MSE distortion Di can be written 
explicitly as follows: 


i PI E 


Hi 


err 


|h] 


pAi I 1 
I 2 ^ 


+ E 


H 2 






\2P11 I 1 
I 2 ^ 


+P1P2 Phi 


erf 


= Pi\Pi 


[\hi\^Pl2 + l\ 


+ E. 


\hi\'^^+l 


Hi 


+ P2Ht]+^-^\pi 


erf 


VWP 21 + I. 



\hi\^Pl 2 + l 




(46) 

(47) 


where the minimum distortion is achieved by dividing the power, i.e., Pn, equally between 
measurements if two measurements are observed from parameter 1, i.e., mi = m 2 = 1. If one 
measurement is requested from each parameter, i.e., (mi = l,m 2 = 2) or (mi = 2 ,m 2 = 1), 
then the minimum distortion is achieved by allocating the entire power, i.e., P 12 or P 21 , to the 
measurement of parameter 1, since cr| = 0. 

Assuming the average power constraint P is satisfied as in the above scheme, we next consider 
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a particular LT scheme. This scheme uses a diagonal encoding matrix if both measurements 
are observed from the same parameter; otherwise, it uses a non-diagonal matrix, where the 
measurement of parameter 1 is transmitted over two ehannels. Then, from Eqn. (fT^ . the MSE 
distortion D 2 ean be written as follows: 


D 2 = Eh, 

+Ehi,H2 




+ E 


H 2 




+ P1P2 Eh,,H2 


(|/llP + |/l2|2):^ + l 


(|fel|2 + |,,2p)i21+l 


(48) 


— ^2 




+ 2piP2jj^ 


‘-I-1 


(49) 


where the minimum distortion ean be aehieved by dividing the power, i.e., Fn, equally between 
measurements if two measurements are observed from parameter 1, i.e., mi = m 2 = 1, similarly 
to the above scheme. If one measurement is requested from eaeh parameter, i.e., (mi = 1, m 2 = 
2) or (mi = 2 ,m 2 = 1), then this partieular seheme divides the power, i.e., P 12 or P 21 , equally 
between two ehannels hi and h 2 for the transmission of the measurement of parameter 1, as 
seen in the term multiplied by pip 2 in (1481) . If two measurements are observed from parameter 
2, i.e., mi = m 2 = 2, then we do not alloeate power, i.e., P 22 = 0, since = 0. 

We ean easily see that D 2 < Di for all Pn, P 12 and P 21 . This implies that the minimum 
achievable MSE distortion of ET sehemes constrained to one-to-one mapping can be improved 
by utilizing non-diagonal encoding matrices, whieh eoneludes the proof of Eemma[3l 
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